PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cotton_A_23417_BGI-A2_v1.0
Common NameF383_13763
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family MYB
Protein Properties Length: 1723aa    MW: 188596 Da    PI: 5.9961
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cotton_A_23417_BGI-A2_v1.0genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding23.61.2e-07816857346
                                 SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
             Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                                 +WT +E e++ d  + +G++ ++++a+ +  ++t  +c+++++k
  Cotton_A_23417_BGI-A2_v1.0 816 PWTSQEKEIFMDKLAAFGKD-FRKVASFLD-HKTTADCVEFYYK 857
                                 8*****************99.*********.***********98 PP

2Myb_DNA-binding33.21.2e-1010341074345
                                  SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHH CS
             Myb_DNA-binding    3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwq 45  
                                   WT eE   +++av  +G++ ++ I+r++g +R++ qck ++ 
  Cotton_A_23417_BGI-A2_v1.0 1034 HWTDEEKSAFLQAVSSYGKD-FDMISRYVG-TRSRDQCKVFFS 1074
                                  6*****************99.*********.********8776 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466896.13E-13800861IPR009057Homeodomain-like
PROSITE profilePS5129314.74812863IPR017884SANT domain
SMARTSM007179.9E-7813861IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.602.7E-4816857IPR009057Homeodomain-like
PROSITE profilePS5129312.74410301081IPR017884SANT domain
SMARTSM007174.2E-910311079IPR001005SANT/Myb domain
SuperFamilySSF466895.38E-1110321081IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.602.0E-610331075IPR009057Homeodomain-like
PfamPF002491.4E-810341074IPR001005SANT/Myb domain
CDDcd001677.92E-810351073No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1723 aa     Download sequence    Send to blast
MPPEPLPWDR KDFYKERKHE RTQSLPQQPL TARWRESSSM SPYQHASFRE FTRWGSADFR  60
RPPGHGRQGS WHLFAEENGG NGYVPSRSSN KILDDENFRQ LDSRVDGKYS RNSRENNRGS  120
YSQRDWRGHS WENCNGSPST PGRPHHVNNE RRSVDDMPTY LSHTHSDFVN TWDQLQKSQH  180
DNKTIAVNGL GTGQKCQSEN LVGSIDWKPL KWTRSGSLSS RGSGFSHSSS SKSLGGVDSG  240
EGKLESQQKN LTPVQSPSGD AAACVTSPAP SDETSSRKKP RLAWGEGLAK YEKKKVEGPD  300
TSIDRAGAKI SVRNTEFNNS LSSNLADKSP RVLGFSDCAS PATPSSVACS SSPGVEEKSF  360
GKAANVDNDT SNLCGSPTLG SQNHLEGPSF NLEKLDINSI INMGSSLTNL LQADDPCTVD  420
SSFVRSTAIS KLLLWKSDVL KALEMTESEI DSLENELKLL KGDSRSRCPC PATSSSFPVE  480
EHGKACGEQE AASSQIPRHA PLQIDACGGV LVEKQPLCNG VLEEVNDDVK DGDIDSPGTA  540
TSKFMEPLSL EKAVSPSDVV KFHECSGDFG TVQLMSMGKV ILATGSGNEG TATTISAEGS  600
VLKRIDNDAH VPESSNSDVG GENVMYEMIL ATNKELANIA SEVFNKLLPK DQYNAEISEI  660
GNVACTESDS AIREKIAIRK QYLRFKERVL TIKFKAFQNA WKEDLRSPLM RKYRAKSQKK  720
YEFSLRSTHG GYQKHRSSIH SRFTFPGNPI LEPSVEMMNF TSKLLLGSHG RLYRNAMKMP  780
ALILDEKEKK VSRFISSNGL VEDPCAIEKE RALINPWTSQ EKEIFMDKLA AFGKDFRKVA  840
SFLDHKTTAD CVEFYYKNHK SECFEKTKKN DLSKQQGKSA VNTYLLTSGK KRGRELNAAS  900
LDVLGAASVI AAHAESGMRN RHTSGRILLR GRFDSKRSQL DDSIAERSSN FDIVGSDQDT  960
VAADVLAGIC GSFSSEAMSS CITSSADPGE GYHHDWKCHK VDSVVKRPST SDVLQNVDGD  1020
TCSDESCGEM DSSHWTDEEK SAFLQAVSSY GKDFDMISRY VGTRSRDQCK VFFSKARKCL  1080
GLDLIHSRTR NMGTPMSDDA NGGETDTEDA CVQESSVVCS EKLGSKVEED LPSTIVSMNV  1140
DESDLTREAN LQSDHNISEG NIERLVDHKD SVAAEVNFSN VDQTEPISEC GAGDMDVDSN  1200
QAESLHVQNN VALANLSALE NHVAEEGVSG AVSASHRGTG DCHPSLDASV EPKSGAAALS  1260
TEGFGNNLEA QETLSSKNVM DVRDTRCNAE IGSQVICRPD LDKSSGESID KNSCLDFSFS  1320
SEGLHQVPLD LGSAGKPSIL LFPNENFSAK NSASHSDASQ CEKICNQDRL SVTLAYQGNE  1380
DKQPNNAVSG HEPEHLSGKP SVDLAELQIS TLKEMDIDIG HCQLPEVKRL STSEKGVTGS  1440
YLVQDFLQKC NGPKSPSEFP QLVQNLEQAN SRPKFHSRSL SDTEKPCRNG NVKLFGQILN  1500
SSSQDDGKVR FPEQSMKSSN LNFRGYNNVD GNASFSKFDQ NIIFAPENVP RRSYGFWDGN  1560
RIQTGLSSLP DSEILVAKYP AAFVNYPASS SQMQLQASQS IVRNTDRNMN GVSVFTPREI  1620
SSSNGVMDYQ VYGGHDCTKV VPFAMDMKRR EMFSEMQRRN GFDAISNLQH QGRGMVGMNV  1680
VGTGVGGVVG GSCPNLSDPV AVLRMQYAKT EQYGGQSGSI MRE
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4a69_D8e-17774865494NUCLEAR RECEPTOR COREPRESSOR 2
4a69_C8e-17774865494NUCLEAR RECEPTOR COREPRESSOR 2
Search in ModeBase
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_012454303.10.0PREDICTED: uncharacterized protein LOC105776280 isoform X2
TrEMBLA0A0B0N7R80.0A0A0B0N7R8_G
STRINGVIT_13s0019g04010.t010.0(Vitis vinifera)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM52602744
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G52250.10.0MYB family protein